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, Abstract 

■ The problem of branched transportation aims to describe the movement of masses when, 
due to concavity effects, they have the interest to travel together as much as possible, because 

■ the cost for a path of length ^ covered by a mass m is proportional to m"^ with < a < 1. 
The optimization of this criterion let branched structures appear and is suitable to applications 
like road systems, blood vessels, river networks. . . Several models have been employed in the 
literature to present this transport problem, and the present paper looks at a dynamical one, 

■ similar to the celebrated Benamou-Brenier formulation of Kantorovitch optimal transport. The 
(-H I movement is represented by a path pt of probabilities, connecting an initial state fiQ to a final 

state /ii, satisfying the continuity equation dtp + divxq = together with a velocity field v 
(with q = pv being the momentum). The transportation cost to be minimized is non-convex 
and finite on atomic measures: ( J^^ P^^^IqI d^i^)) dt. 
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^ ! 1 Introduction 

The optimal mass transportation theory consists in the study of transporting a given mass distri- 
bution fiQ on Q (that we assume to be a compact and convex subset of W^) into a final configuration 
^ ' by minimizing the total transportation cost, the latter being suitably defined: clearly, /io and fii 

■ are required to satisfy the mass balance condition dfiQ = dfii. From now on, we will assume 

that they are normalized to be probability measures. The cost for moving a unit mass from a 
position x to a position y is taken equal to c{x,y), a function a priori given, which determines the 
nature of the problem and provides the total minimal cost 

C{no,ni) = mini / c{x,y) dj{x,y) : 7 G r(^o, w) f (1-1) 
IJnxn ) 

where r(/xo,/ii) is the class of admissible transport plans, i.e. probabilities on the product space 
X $7 having first and second marginals given by /xq and /ii respectively. The cases c(x, y) = |x — 
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with p > 1 have been particularly studied, and the cost C(/UO)/^i) in (ll-ip provides, through the 
relation 

the so-called Wasserstein distance Wp which metrizes the weak* convergence on the space of prob- 
abilities A very wide literature on the subject is available; we simply mention the books 
[21 [26l [25] where one can find a complete list of references. 

Thanks to the fact that the space Wp(f^) of probability measures endowed with these distances 
turns out to be a geodesic space, dynamical models for optimal transportation are of particular 
interest. Being a geodesic space means that the distance between two points is always equal to the 
infimum of the lengths of the curves connecting these points, and that this infimum is actually a 
minimum: 

W^p(/^o,^i) = minj^ \p't\Wpdt : G Lip([0, 1]; >Vp(0)), po = f^o, Pi = w| , 

where Ip'lvi/p is the metric derivative of the measure- valued Lipschitz curve p, defined as (we refer 
the reader to [2], for more details) 

I ,| _ Wp{pt+h,Pt) 
\Pt\Wp - hm 7 • 

Since the curves connecting two points of this space are actually curves of measures, they can 
be described through the so-called continuity equation: it is well-known (see [2], Theorem 8.3.1) 
that for every Lipschitz or absolutely continuous curve pt in the space Wp(0) (p > 1 for simplicity) 
there exists a map q from [0, 1] into the space of vector valued measures, such that qt ^ pt (hence 
Qt = vt ■ Pt, V being the velocity vector) which represents the fiux q = pv and satisfies 

dtp + dwxq = and \\vt\\LP{pt) = \Pt\Wp, (1-2) 

(the degenerate case p = 1 being a little bit more involved, since qt ^ pt is no more guaranteed 
and the L^— norm has to be replaced by the mass of the measure qt, see [1]). 

On the other hand, every time that we have a pair {p, q) satisfying dtp + d\Vx 9 = with q <^ p, 
so that qt = Vt • pt, we can infer that l/J^lwp < ll^ill-LP(pt)- This means that one can minimize the 
functional 

^ ^) Uo ( Ir'^ 1^*1^ dpi) if 9 < P and qt = Vf pt, 
[-|-oo otherwise, 

which is nothing but the integral in time of the kinetic energy when p = 2, and the cost in (jl.ip 
can be recovered through the equality 

C(/zo,/xi) = minjy" ^ dpt{x)^ dt : dtp + diVa: q = 0, Po = Po, pi=/ii|. 

The problem above is the one which was proposed by Benamou and Brenier in [3] as a dynamical 
version of optimal transportation. It has the advantage that it is the minimization of a convex 
functional of p and q, under linear constraints. 

Other variants of mass transportation problems have been studied and can be expressed in this 
way by considering in (jl.3p other convex functions of the pair {p,q). Recently, Dolbeault, Nazaret 
and Savare introduced in [T7] new classes of distances over V{W^) based on the minimization of 
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the functional (where A is a given reference measure on and p and q are identified with their 
densities w.r.t. A) 

which are connected to the non-hnear mobihty continuity equation dtp + div^. (^h{p)v^ = (a 
treatment of the hmiting case h{p) = 1, corresponding to consider g) = \q\P, can also be 
found in [13]). If the function h is concave (for example h{p) = pl^ , with /3 E [0, 1]), this problem 
turns out to be convex as well. The main interest that motivated Dolbeault et al. to the study of 
these distances lies in the possible applications to diffusion equations of the type of the non-linear 
mobility continuity equation dtp + di\x {h{p)v^ = above, where the vector field v depends on p in 
a way such that the equation can be interpreted as a gradient flow of a given functional with respect 
to these new family of dynamical distances. Moreover, the equations of the geodesies (which are 
similar to a mean-field game system, see [H]) and the conditions for these distances to be finite are 
being studied in [21]. 

In connection with congestion effects and crowd motion, other models include penalizations on 
high densities: in |14] the case 

Hp,q) = ^ + cp^ P>hc>0 
pP ^ 

has been considered as a model for crowd motion in a congested situation (for instance in case of 
panic). This problem as well is convex. 

A completely different situation occurs in the case opposite to congestion, when concentration 
effects are present and the mass has the interest to travel together as much as possible, in order 
to save part of the cost. This happens very often in many applications, as discovered by Gilbert 
who in [18] formulated a mathematical model for the transportation of signals along telephone 
cables. More recently the Gilbert's model has been refined and considered in the framework of 
mass transportation, under the name of branched transport, to emphasize the fact that transport 
rays may bifurcate. All these models have in common the fact that the cost for a mass m moving 
on a path of length i is proportional to m'^i (0 < a < 1, so that (mi + m2)" < mf + m2). In 
[H E El El El] for every < a < 1 a transportation cost from po to pi is considered through a 
suitable use of probabilities defined on spaces of curves in Q, with |21j (the so called irrigation 
patterns model) dealing with the case of a single source po = 6xo- See Section 4 to have a glance 
at the details of these models and their formulations. On the other hand, the model of [27] can 
be seen as the natural extension of the original Gilbert's model and uses vector measures having 
prescribed divergence po — pi'- these vector measures are the continuous generalization of the finite 
weighted and oriented graphs that were present in Gilbert's original formulation. 

A first attempt to obtain a dynamical formulation of branched transportation through curves 
of measures was made in [10], and later refined in [IH I12j : in these papers the starting point 
is the geodesic formulation of the Wasserstein distance, where the length functional is modified 
considering an energy of the type 

1 

9{pt) \p't\Wpdt. 

The weight function g is a. local term of the moving mass, forcing the mass to concentrate and thus 
giving raise to branching phenomena. 

These models are not satisfactory yet, because they are in general not equivalent to those by 
Gilbert, Xia or Bernot-Caselles-Morel. A tentative to perform some modifications in the functionals 
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defined on curves of measures so as to obtain equivalence with the other models has been made 
in [11], where on the other hand some quite involved distinction between moving mass and still 
mass has been done. In all of these models, the branched transportation is studied avoiding the 
Benamou-Brenier approach consisting in the minimization of a suitable cost J-{p^ q) under the 
constraint of the continuity equation dtp + div^ q = 0, that we believe is the most natural for this 
kind of problems. The only approach to dynamical branched transportation using the continuity 
equation is, as far as we know, the one of [9]. Yet, to prove semicontinuity and hence existence, 
even in this model, the problem is reduced to the minimization of a functional of the form 

j e''dn\x,t) 

(which is the energy of Xia in [27j) and the dynamical features are not completely exploited. 

In the present paper we follow a more direct approach: for all pairs {p, q) verifying the continuity 
equation, with po = A*o and pi = pi, we define a functional T{p, q) and we show that this functional 
is both lower semicontinuous and coercive with respect to a suitable convergence on (p, q), and this 
provides directly the existence of an optimal dynamical path. The paper is organized as follows: 

• in Section [2] we give the precise setting and state the main results; 

• Section [3] is devoted to the proofs giving the existence of an optimal path pt; 

• in Section U] we show that our model is equivalent to the other models of branched trans- 
portation available in the literature, comparing it to the traffic plan model of [4], which is 
one of the most flexible (and anyway equivalent to the others, as shown in [6], Chapter 9); 

• in the Appendix we deal with some inequalities involving Wasserstein distances and branched 
distances, that is distances over the space of probabilities given by the minima of some 
branched transportation problems. These inequalities have already been studied in [23] and 
[16] , but some very precise issues concerning da and Wi are very close to the topics of this 
paper and deserve being treated here. New and simpler proofs are provided. 



2 Problem setting and main results 

In this section we fix the notation and state the main results of the paper. In the following will 
denote a given subset of M*^, where all the mass dynamics will take place; for the sake of simplicity 
we assume that $7 is convex and compact. The space P(0) of all Borel probabilities on can then 
be endowed with the weak* convergence, which is metrized by the Wasserstein distances (see the 
Introduction). In the following, we will also use the notation A^(r2;]R'^) to indicate the space of 
R'^-valued Radon measures over 0, while will indicate the A;— dimensional Lebesgue measure. 

The main objects to be considered will be pairs (p, q) with 

p G C([0, 1]; P(0)) , q G L'{[0, l];M{n; M.')) (2.1) 
satisfying the continuity equation formally written as (here i' stands for the outer normal versor to 

an) 

( dtp + diva; q = 0, in [0, 1] x Q 

\ q-u = 0, on [0, 1] X 5(7, ^ ' ' 
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whose precise meaning is given in the sense of distributions, that is 



dt4>{t,x)dpt{x) + I D:^(t){x,t) ■ dqt{x) 
Jn 

Ml,x)=0. 



dt = 



(2.3) 



'0 ^Jn 

for every smooth function (j) with (/>(0, x) = 

Definition 2.1. We denote by D the set of all pairs {p,q) satisfying (j2.ip and (|2.3p . Moreover, 
given /Uo,/^i G T^i^), we define the set D{fio,fii) of admissible configurations connecting fiQ to fii 
as 

^{^J'0,^J'l) = {{p,q) ■ po = a^o, /Oi = /^i}- 

The velocity vector v can be defined as the Radon-Nikodym derivative of the vector measure q 
with respect to p: 

dq 

dp 

Among all pairs (p, q) £ D satisfying the continuity equation above, we consider a cost function 
T{p, q) of the form 



J='{p,i 



F{pt,qt)dt, 



(2.4) 



where F is defined through 



F{p,q) 



' Ga{\v\^/'' ■ p) ifq = v-p, 

+00 if q is not absolutely continuous w.r.t. p 



and Ga (0 < q < 1) is a functional defined on measures, of the kind studied by Bouchitte and 
Buttazzo in [5]: Gq(A) = +oo if A is not purely atomic, while (# stands for the counting measure) 

Ga{X)= [ |A({x})rd#(x) = V|A,r, ifA = VAA,. 



In 

In this way our functional F becomes 

"1 r- 



Hp. 







\vt{x)\pti{x}r d#{x) 



n 



dt 



Vt,i\PtA 



dt, (p,g)GD, 



and the dynamical model for branched transport we consider is 

Sail^o, l^i) ■= min F{p,q). 

(p,(j)G2){/io,A»i) 



(2.5) 



Our main goal is to show that the minimization problem (j2.5p above admits a solution. This 
will be obtained through the direct methods of the calculus of variations, consisting in proving 
lower semicontinuity and coercivity of the problem under consideration, with respect to a suitable 
convergence. 

Remark 2.2. We point out that the weak* convergence of the pairs {p,q) does not directly imply 
the lower semicontinuity in (j2.5p . since the functional is not jointly convex. On the other hand, if 
(/?",g") G D and we assume 

ip7, It) ^ {Pu qt), for ^i-a.e. t G [0, 1], 

then a simple application of Fatou's Lemma would lead to the desired semicontinuity property of J- 
(because one could prove that F is a lower semicontinuous functional on measures, as a consequence 
of the semicontinuity of Ga and of the convexity of {x, y) i— )■ \x\^ /y^~^). 
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In order to prove in the easiest possible way a semicontinuity result, we will introduce a conver- 
gence which is stronger than the weak convergence of measures on [0, 1] x Q, but weaker than weak 
convergence for every fixed time t. This convergence will be compatible with the compactness we 
can infer from our variational problem. 

Definition 2.3. We say that a sequence T-converges to {p,q) if {p'^,q'^) {p,C[) in the 

sense of measures and 

sup F{p]^,q^)<+^. 

nGN, te[0,l] 

Theorem 2.4. Let (p", g") be a sequence such that J~{p"', g") < C, then up to a time reparametriza- 
tion, {p^,q^) is T- compact. 

Theorem 2.5. Let {p^,q^) £ ^ be a sequence r-converging to {p,q)- Then 

F{p,q) < liminf J^(p",g"). 

As a consequence we obtain the following existence result. 

Theorem 2.6. For every po,Pi G T^i^), the minimization problem (]2.5p admits a solution. 

Remark 2.7. We remark that, for some choices of the data po,pi and of the exponent a, the 
statement of Theorem 12.61 could be empty, because the functional F could be constantly +oo on 
every admissible path {p,q) joining po to pi. This issue will be solved in Section [H where the 
equivalence with other variational models for branched transportation will be proven. Since for 
these models finiteness of the minima has been widely investigated, we can infer for instance that 
if a > 1 — 1/d then every pair pQ and pi can be joined by a path of finite energy. On the other 
hand, ii a < 1 — l/d, po = 6xg and pi is absolutely continuous w.r.t. then there are no finite 
energy paths connecting them. 

3 Proofs 

A preliminary inequality to all the proofs is the following: if q <^ p, then qt = Vf pt and 

/ \" (3.1) 

due to the sub-additivity of the function x i— )• x". This inequality and its consequences will be 
discussed in the Appendix as well. In particular it also follows 

F{puqt) > lbt||Li(pO = \qtm)- (3.2) 

3.1 Proof of Theorem [2741 

Due to the fact that the functional J- is 1-homogeneous in the velocity, it is clear that reparametriza- 
tions in time do not change the values of J-. By reparametrization, we mean replacing a pair (/), q) 
with a new pair {p, q) of the form pt = Pip{t)i Qt = ^'{t)(lip{t) (which equivalently means that q is the 
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image measure of q through the inverse of the map i— )• {ip{t),x)). Thanks to this invariance, 
if [p^, q^) is such that J^{p^, q^) < C, then one can define a new pair (p", g"), with 

F{p^, q^) = ^(p", q^") = ^(p", g") < C, for every t. 

After that, we only need to prove compactness for the weak convergence of measures on [0, 1] x 
a fact which only requires bounds on the mass of p" and g". The bound on is straightforward, 
since for every t the measure p" is a probability, while for q"', which is absolutely continuous w.r.t. 
p", it is enough to use ()3.2p in order to bound the mass of q by C. 

This allows to extract a subsequence (p"'' , q^'' ) which converges weakly to a pair (p, q) . The only 
nontrivial point is that we a priori restricted our attention to pairs (p, q) where p E C([0, l];V{fi)) 
and g E ([0, 1]; A^(r2; M'^)) , so that we need to prove that p is continuous and that q is of the 
form J qtdt. Yet, the inequality (|3.ip applied to the pairs {p'^,q^), proves a uniform bound on the 
1^1/ a ];^orm of the velocities, which implies that the curves p" are uniformly Lipschitz continuous 
according to the distance W^i/q,, and this property is inherited by the limit measure p. 

For the decomposition of q, just use the inequality (j3.ip . thus obtaining a uniform bound on 
||t;"||^i/a(-pn-), which a fortiori gives a uniform bound on the Benamou-Brenier functional 




This functional being lower semicontinuous, we can deduce the same bound at the limit: this in 
particular implies that q is absolutely continuous w.r.t. p, with an density. Since p is a measure 
on [0, 1] X which is of the form J pt dt, the same disintegration will be true for q. 

This means that we have actually found an admissible pair (p, q) which is the r— limit of 
iPt'^^Qt^'') ^^'^ the proof is complete. 

3.2 Proof of Theorem [231 

We consider here a sequence (p", 9'^), where q^ = v'^ ■ p" (otherwise the functional J- would not be 
finite- valued) , satisfying the continuity equation and such that {p'^,q^) r— converges to {p,q)- 
First of all we define a sequence of measures m" on [0, 1] x through 

"^" = / (^J2pti{^^ArKi^i,t)\s.,,}j dt, 

where the points Xi^t are the atoms of q^ (i.e. the atoms of p" where the velocity does not 
vanish). We notice that J"(p", g") = m"([0, 1] x O). 

In order to prove lower semicontinuity of J-' we can assume J-{p^,q^) to be bounded. This 
bound implies the convergence tn'^ ^ m, up to the extraction of a subsequence (not relabeled). It 
is clear that, on this subsequence, we have 

lim ^(p'^, g") = lim m"([0, 1] x (7) = m([0, 1] x O), 

n— ^00 n— >oo 

then in order to prove the desired semicontinuity property, it is enough to get some proper lower 
bounds on m. 

Notice that, since we have m"([a, 6] x 0) = /^^ F(p", q") and F{p^,q^) < C (by definition of 
T— convergence), we know that the marginal of m" on the time variable is a measure with an L°° 



7 



density bounded by the constant C on [0, 1]. This bound is uniform and will be then satisfied by 
the limit measure m too: in particular, this implies that we can write m = J mt dt. 
Let us fix a closed set Q, as well as a time interval [a, 6], and take the function 

Xm{x) := (1 - M dist(x,Q)) + , x G 0, 

where (•)+ stands for the positive part: observe that xm is positive, takes the value 1 on Q, is 
M-Lipschitz and vanishes outside a 1/M-neighborhood of Q. Indicating by 1^; the characteristic 
function of a generic set i.e. the function which takes the value 1 on S and elsewhere, we 
consider (f>{t,x) = XM{x)°'l[a,b]i't): which is upper semicontinuous on [0, 1] x fi. Then we have 

4>{t,x) dm{t,x) >\imsup / (j){t,x) dm^{t,x) 

n— >oo J 

= iimsup ^[^pn{^i}rKi^^)\xM{^^r] dt, 

n^co Ja \ ^ J 

where the points before, the atoms of (and we omitted the dependence on n and t). We 

then decompose the product Pt{{xi})"xM{xi)" as {pt{{xi})xM{xi))°' ^ ■ {pt{{^i})XM{xi)) (where 
PtXM > 0). Notice that Pt'{{xi})xMixi) < / xm dpi- Then we can estimate the right-hand side in 
the previous inequality as 

[{^p'l{{x^}T\vn^i)\XM{xiT'^ dt> [(yf XMdp^y '(^^pn{^a)l<(^^)IXA/(^^)^ dt 
We go on by estimating from above J xm dp^: we have 



XM(x)dprW< / XMix)dp^{x) + MWM,Pl), 
n Jn 

which is a consequence of the definition of Wi by duality with 1-Lipschitz functions (see |26j 
Theorem 1.14). To estimate the Wi distance we use Wi < Wij^ and the following fact 

t ft 



Wi/aip7,p'^)< / |(p^yu,/.dz< / \Kh./.(p^)dz, 

J s J s 

then applying inequality (|3.ip we have in the end 

/ XM{x)dp'l{x) < / XM{x)dp';i{x) + CM{b-a), for every t € [o, 6]. 
In this way we have 



XM dpf] ( [ XMdWn dt) dt>([ XM dpl + CM{h - a}\ t ( jxM dWt\ ) dt 



I XM dp: + CM{b - a)) I d\q^l 



Hence, we may go on with 



f cpdm> limsup ( f Xm dpi + CM{b - a)) [ 0^/" d\q' 

> XMdpa+CM{b-a)^ 1 

In the last inequality the second factor has been dealt with in the following way: suppose ^ a, 
then we have a > \q\; moreover (p > <j) where <j){t,x) := XM{x)l(a,b){'t), and this last function is l.s.c. 
and positive, so that 

liminf [ >liminf [ > [ ^^/"^ da > [ <^^/°(i|g| = [ cp^^'^ d\q\, 

n^oo J n^oo J J J J 

since the boundaries t = a and t = b are negligible for \q\. 

After that, we can divide by {b — a) (keeping for a while M fixed) and pass to the limit as 6 — ?• a. 
This gives, for a.e. a € [0, 1], 

XM{xTdma{x)> ( / XM{x)dpa{x)] I XM{x)d\qa\{x). 



We let now M — t- oo, so that xm monotonically converges to the characteristic function of the set 
Q, and we have, by dominated convergence w.r.t. ma-,Pa and l^al, 

xaa{Q)>Pa{Qr-^\qa\{Q). (3.3) 

In the last term the convention O-oo = is used (if |q'a|(Q) = 0). This inequality is proven for closed 
sets, but by regularity of the measures it is not difficult to prove it for arbitrary sets. Actually, if 
5 C is an arbitrary Borel set, we can write 

ma(5) > ma{Q) > Pa{Qr~'\qa\{Q) > Pa(5)°-' ka| (Q), 

for every Q C S closed, and take a sequence of closed sets Qk such that \qa\{Qk) kaK-S), since 
l^al is, for a.e. a G [0, 1], a finite (and hence regular) measure on the compact set Q,. We want 
now to prove that 

• g < p, 

• Qa = Va ■ Pa IS atomic for ^ — a.e. a G [0, 1] (i.e. pa is atomic on {va / 0}) 

• main) > F{pa,qa), for ifi-a.e. a G [0,1]. 

This would conclude the proof. 

The first statement follows from the inequality ()3.ip . and the behavior of the Benamou-Brenier 
functional Ai/aip^, <f^) as in the proof of Theorem 12. 4^ which guarantees an L^/°^ density of q w.r.t. 
p. As a consequence, since p is a measure on [0, 1] x which disintegrates w.r.t. the Lebesgue 
measure on [0, 1], the same will be true for q and we can write qt = vt ■ pt- 

For the second statement, take the inequality ma(5') > Pa{S)'^~^\qa\{S) which is valid for any 
Borel set S, and apply it to sets which are contained in the Borel set Ve := {x £ il. : \va{x)\ > e}. 
For those sets, we have easily ma(S') > epa{S)". This means that the measure A := e^^°^paL-Vs 
satisfies the inequality A(S')° < m{S) for every Borel set S C Ve- Since m is a finite measure, this 
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implies that A is atomic (see Lemma l3.ll below). If the same is performed for every e = l/fc, this 
proves that pa is purely atomic on the set {x : \va{x)\ / 0}, that is qa = Va • Pa is purely atomic. 

Once we know that qa is atomic we infer that F{pa, qa) = Yli Pa{{xi})°^~^\qa\{{xi}) and we only 
need to consider Q = {xi} in (|3.3p and add up: 

ma(S^) > ^ma({xi}) > F{pa,qa), 

i 

which finally concludes the proof. 

Lemma 3.1. Take two finite positive measures A and ^ on a domain Q., and a € (0, 1). Suppose 
that the inequality X{S)°' < p{S) is satisfied for every Borel set S <zVt. Then A is purely atomic. 

Proof. Consider a regular grid on of step l/Zs, for A; G N, and build a measure Afc by putting, in 
every cell of the grid, all the mass of A in a single point of the cell. This measure \k is atomic and 
we have 

i i 

where the Si are the cells of the grid. If we let k goes to oo, the step of the grid goes to zero and we 
obviously have A^ ^ A. On the other hand, the functional Ga is lower semicontinuous (see [8]) and 
this implies Go(A) < liminffc^oo Gci(Afc) < ^(il) < +oo. In particular, A is atomic, thus proving 
the assertion. □ 

3.3 Proof of Theorem [Ml 

In order to prove existence, one only needs to take a minimizing sequence and apply Theorem 12.41 
to get a new minimizing sequence which is r— converging: this new sequence is obtained through 
reparametrization (which does not change the value of F) and by extracting a subsequence. Since 
the constraints in the problem are linear, i.e. pi = pi for i = 0, 1 and the continuity equation, the 
limit (/O, q) will satisfy the same constraints as well. The semicontinuity proven in Theorem 12.51 
allows to obtain the existence of a solution. 

4 Equivalence with previous models 

In this section we prove the equivalence of problem (12. 5p with the other previous formulations of 
branched transport problems, existing in literature. In particular, as a reference model we will take 
the one presented in [6], in which the energy is defined as 

EciQ) = fj^^ kWir' dtdQ{a), 

where C = C([0, l];r2), Q is probability measure over C and concentrated on the set Lip([0, 1]; ri) 
{traffic plan) and for every x € fl, the quantity \x\q is the multiplicity of x with respect to Q, 
defined by 

\x\Q = Q{{aeC : x € a{[0,l])}) . 
Given po, pi G V{Q), the corresponding minimum problem is then given by 

da{po,pi)= min Ea{Q), 
QGTP{/io,/ii) 
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where TP{fiQ, fii) is the set of traffic plans with prescribed time marginals at t = 0, 1, that is 
TP{fiQ, fj.i) = {Q £ C : Q concentrated on Lip([0, 1]; 17), {ei)^Q = fJ-i, i = 0,1}, 

and : C — 7- is the evaluation map at time t, given by et(iT) = a{t) for every a £ C. 

Remark 4.1. We point out that this model is completely equivalent to the one developed by Xia 
(see [27] for the presentation of the model and [6], Chapter 9, for the equivalence), which is based 
on a relaxation procedure, starting from an energy defined on finitely atomic probability measures 
Ho and /ii. In particular, thanks to this relaxed formulation, we get that for every and fii, there 
exist two sequences //q and /i" of finitely atomic probability measures, weakly converging to fiQ and 
III respectively, and such that 

daif^o^Hi) da{l-lO,fJ'l)- (4.1) 
We also need to consider a slight modification of the functional Ea above, introduced in [7], 

CaiQ) = l(^(i),i)lQ-' l^'WI dtdQia), 

where now the synchronized multiplicity \{x,t)\Q is 

\{x,t)\Q=Q{{aeC : a{t)=x}). 

This second multiplicity accounts for the quantity of curves passing at the same time through the 
same point, while the one used in the definition of considered all the curves passing eventually 
through the point: in this sense, the model corresponding to the energy Cq, is more dynamical in 
spirit. As a straightforward consequence of the definition of the two multiplicities, we get 

\a{t)\Q>\{a{t),t)\Q, (4.2) 

so that Ea{Q) < Ca{Q). Concerning the comparison between the minimization of E^ and Cq,, we 
recall the following result (see [7], Theorem 5.1). 

Theorem 4.2. Let fiQ, fii G V{i^), with /io a finite sum of Dirac masses. Then for every a G [0, 1] 
we get 

min Ea{Q) = min Ca{Q)- 

We are now in a position to state and prove a result giving the equivalence between our model 
and the one relative to the energy 

Theorem 4.3. For every a G (0, 1) and /iO) A*i £ ffct 

Sa{l^o,l^i)= min EaiQ) = da{lJ^o,ni). (4.3) 

QeTP(Mo,/ii) 

Proof. We first prove the inequality Sq(/Uo,/^i) > dail^o, l^i)- Clearly, if i3a(/io,^i) = +oo there 
is nothing to prove; otherwise, take (/9, q) optimal, which implies, by the way, that q = v ■ p and 
that q is atomic. Thanks to the superposition principle (see [2], Theorem 8.2.1) we can construct a 
probability measure Q £ C such that pt = {et)^Q and Q is concentrated on absolutely continuous 
integral curves of v, in the sense that 



c 



a{t) - cr(0) - / Vs{cr{s))ds 







dQ{a) = 0, for every t G [0, 1]. 
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Using this information, together with the fact that < Cq and exchanging the order of integration, 
we get 



Eo^iQ) < C^iQ) = [ [ 
Jc Jo 

1 



JC 

1 



\{a{t),trQ-'\a'it)\dtdQ{a) 
\ia{t),t)\'^-^\a'it)\dQia)dt 
\{a{t),t)\'^-^\vt{amdQ{a)dt 
\{x,t)\'^-'\vt{x)\dpt{x)dt. 

lo Jn 

Then we observe that, by virtue of the fact that pt = {et)f,Q, there holds 

\{x,t)\Q = Q{{a G C : a{t) = x}) = pt{{x}), 
so that we can rewrite the last integral as 



JC 

1 



1 

a-l 



/o Jn 
which then gives 



r / PtiixW' d\qt\{x) dt = 
Jo J Jo 



pt{{x}r-'\vt{x)\dpt{x)dt= / pt{{x}r''d\qt\{x)dt= / yivt^ptdt 



da{po,fJ-i) = min Ea{Q) < F{p,q) = Ba{po,fJ'i)- 

Q6TP(aio,Mi) 



In order to prove the reverse inequality, we first prove that 

Saif^o,l^i)< min CaiQ). (4.4) 

QeTP{^la,^J.l) 

Take Q S TP{pQ, pi) optimal for Ca, then we know that there exists a pair {p, q) which is a solution 
of the continuity equation, with pt = {et)^Q and qt = Vt • pt- The velocity v may be chosen as 



vt{x) = j a'{t)dQ''^a), 

where Q^'^ is the disintegration of Q with respect to the evaluation function ej (see [20] for this 
representation formula of the velocity field v). This means that each Q*'^ is a probability measure 
concentrated on the set {a G C : a{t) = x} and Q = J Q^'^ dpt{x). Therefore, arguing as before 

CM = I' 1^ 1(^(0, dQi^) 



' lj{x,t)\'^-' (/ \a'{t)\dQ''^a)^ dpt{x)dt 



Jn 
1 



> / / \{x,t)\l-'\vt{x)\dpt{x)dt 
Jo Jn 



1 

Pt{{x})'Q'^ \vt{x)\ dpt{x) dt, 

Jn 



that gives the desired inequality (j4.4p since, even if we do not know that qt or pt are atomic we can 
restrict the last integral to the set of atoms of p. 
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Summarizing, up to now we have shown 



dailJ'O^IJ'i) < SaifJ-o^fJ'i) < min CaiQ), 

and equahty holds whenever fiQ is a finite sum of Dirac masses, thanks to Theorem 14.21 In order 
to conclude, it is enough to notice that thanks to Remark 14.11 we may take two sequences /Xq and 
of finitely atomic probability measures such that ^ hq, /i" ^ /ii and 

thus getting 

da{fJ-o,fJ'i) < 13a{no,fJ-i) < liminf ;BQ,(/i[},;U^) < lim daifJ-o , fJ^i) = da{fJ-o, fJ-i), 

hence concluding the proof. □ 

Remark 4.4. Observe that in the previous Theorem, we did not only prove the equality of the 
minima, but we also provided a natural way to pass from a minimizer of our formulation d la 
Benamou-Brenier to a minimizer of the traffic plans model and back. The two problems are thus 
equivalent in the sense that they describe the same kind of energy and the same optimal structures 
of branched transport: the simple equality of the minima ()4.3p is just a consequence of this more 
important fact. 



Appendix: the distances da and Wi/a 

This last section is devoted to estimates between the distance da induced by the branched transport 
and the Wasserstein distances Wp. In particular, in [23] the following estimates are proven for 
a > 1 — 1/d and p > I: 

da < cw;}'^''-^^+\ 

As far as lower bounds on da are concerned, the most trivial one is da > Wi but [16], Theorem 8.1, 
also proves da > Wi/a, which is slightly better. Moreover, for scaling reasons (w.r.t. the mass) it 
is not possible to go beyond p = 1/a in this last inequality. 

In this paper we already needed to estimate some branched transport cost in terms of Wi/a 
distances and metric derivatives. In this section we prove the inequalities 

W^/a <da<C Wfj2~'^^' Va G (1 - 1/d, 1]. 

These inequalities are just a particular case of those that are already known. We restrict our 
attention to p = 1/a in the first one and p > 1/a in the second one (as we said, they are proven in 
[El [23]), but the proof we will provide is different and somehow simpler. 

The first inequality will be approached through the formulation of branched transport we gave 
in this paper, but the main tool (i.e. inequality (j3.1|) ) is essentially in common with [16] and [22] . 
What is different is the way to extend this idea to generic measures, i.e. non-atomic ones. 

Theorem 4.5. For every /xo,/ii € 'P(^) we get 

Wi/a{f^O,l^l) < (ia(/io,/^i)- (4.5) 
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Proof. We first observe that thanks to the results of the previous section, for every no,fJ,i G 'P{il.) 
we get 



1 , 



\vtix)\pti{x}r d#ix) 



dt. 



for a suitable {p,q) admissible in the formulation (j2.5p . with q = pv. Moreover, using once more 
the inequality ()3.ip the right-hand side in the previous expression can be estimated as 



\vtix)\pti{x}r d#ix) 



dt > 



dt 



and finally, using the fact that {p,q) is solution of the continuity equation, we can infer (see [2], 
Theorem 8.3.1) 

\pt\w^/„ < ll^i|lLi/c(pt)> for ^i-a.e. t G [0, 1], 

so that 



da{po,Pi)> / \p't\wuc,dt>Wi/a{po,pi), 
Jo 

where in the last inequality we just estimated the length of a curve by the distance between its 
endpoints. Thus we have obtained ()4.5p . concluding the proof. □ 

In order to prove the other inequality, first of all we have to introduce some notations: we set 
Q = [0, 1)^ and Ql = [0, L)'^, for every j G N we consider the following subset of multi-indexes 

= G : ||z||oo < 2^ - 1}, 

and observe that ^{Bj) = 2^'^, then we make a partition of the cube Ql by dyadic cubes having 
edge length L/2^ , i.e. 

<3.=u<?;- U 

i=l zeB-j 
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For every p G V{Vt) such that C Ql, its dyadic approximation is given by 



2Ja 



i=l 



where p^^ = p{Q^j) and x*- is the center of Q*. We shall always assume that Q C Ql for a suitable 
L, then the following estimate is well-known (Proposition 6.6, [6]). 



Proposition 4.6. Let a G (1 — l/d, 1], then for every p G V{il.) we have 

_ , , , 2(<^(i"-)-i)^- LVd 
da{a,{p),p) <-^-^^^—^—-. 



(4.6) 



The main tool if one wans to estimate da from above by a power of Wp is to show that the 
distance da between two dyadic approximations can be estimated in terms of their 1/a-Wasserstein 
distance: this is the content of the next result. 



Lemma 4.7. Let a G (1 — l/d, 1], then for every po,pi G V{^1) we get 

da{aj{po),aj{pi)) < CWi/a{aj{po),aj{pi))2i''^^-"\ 
with C depending only on N and a. 



(4.7) 
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Proof. Let us consider an optimal transport 7^ between aj{no) and aj{ni), for the cost c{x,y) 
\x — y\^^°', that is jj € x f]) and it is of the form 



7j = Mj{i,k)6^. 

i,k=l 

with the 2^'^ x 2-''^ matrix {Mj{i, k)}i^j^ belonging to the convex set 9Jt given by 



1=1 fc=i 



We know by optimality that {Mj{i, k)}i^k can be taken to belong to Ext (9Jt), the set of extremal 
points of dJt, which consists of the so-called acyclic matrices (see [H]). They are those matrices 
belonging to 9Jt such that the following property holds: 

s 

r=l 

for every 2 < s < 2^'^ and every set of indices ii < ■ ■ ■ < ig G {1, • • • , 2^'^}, ki < ■ ■ ■ < kj G 
{1, . . . , 2^'^} (the convention i2J<*+i — ^'^d k2jd^i = ki is used). This implies in particular that 

#{{i,k) : Mj{i,k)^0}<2-2^^, (4.8) 

that is {Mj{i, k)}i^k has at most 2 • 2-''^ non-zero entries: in other terms, this optimal transport plan 
7j does not move more than 2 • 2-''^ atoms. Setting \x'j — Xj \ = £i^k, we then get 



Wy^{aj{no),aj{ni))= ^ Mj{i,k)£]; 



23 

1/a 
k 



and using (|4.8|) and Jensen's inequality 

da{aj{fio),aj{ni)) < ^ Mj{i,k)''ti^k= Yl 

i,k=l i,k=l ^ 

< M,{i,k)i-, m{i,k) : M,(i,A;) /O})^-" 

<C7H^i/,(a,(/.o),ai(/xi))2^'^(i-"), 
concluding the proof. □ 
Theorem 4.8. Let a G {1 — l/d,l], then for every p > 1/a, we get 

daif^o,fii) < C VFp(/xo, /ii)''^"~'^+' , (4.9) 
with a constant C depending only on d, a and on the diameter ofO,. 
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Proof. It is enough to show the vahdity of (I4.9p for p = 1/a, then the general case will be just a 
consequence of the monotonicity property of the Wasserstein distances, i.e. 

Wi/a < Wp, for every p > 1/a. 

Using the triangular inequality, ()4.6p and ()4.7p . we get for every j G N 

da(/^o,Mi) < da{fio,aj{^-io)) + da{aj{f-io),aj{fii)) + da{aj{fii), 

< C2('^(i-")-i)^- +C7W^i/„(a,(/io),a,(m))2^'^(^-"), 

and 

<C2-^ + l^i/J/xo,m), 

which finally gives 

da(^o,w) < C2Wi-°)-i)-'- +CT^i/„(/io,/ii)2^'^(^-) 
= C2Wi-°)-i)-'(l + Tyi/,(/.o,^i)2^). 

It is now sufficient to choose the index j in such a way that 
which in turn yields 

thus giving the thesis. □ 

Remark 4.9. As we briefly mentioned, observe that the distances da and VFi/q, have exactly the 
same scaling with respect to the mass. 

Remark 4.10. We point out that the very same /.to and /ii of Example 6.19 in [6] show that the 
exponent d{a — 1) + 1 in inequality (|4.9p cannot be improved. 
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